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Abstract 

We consider a GI/GI/1 queue with the shortest remaining processing time discipline (SRPT) 
and light-tailed service times. Our interest is focused on the tail behavior of the sojourn-time 
distribution. We obtain a general expression for its large-deviations decay rate. The value of 
this decay rate critically depends on whether there is mass in the endpoint of the service-time 
distribution or not. An auxiliary priority queue, for which we obtain some new results, plays an 
important role in our analysis. We apply our SRPT-results to compare SRPT with FIFO from 
a large-deviations point of view. 
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1 Introduction 



In queueing theory the shortest remaining processing time (SRPT) discipline is famous, since it is 
known to minimize the mean queue length and sojourn time over all work-conserving disciplines, 
see for example Schrage |22J an d Baccelli & Bremaud Recent developments in communication 
networks have led to a renewed interest in queueing models with SRPT. For example, Harchol-Balter 
et al. |13j propose the usage of SRPT in web servers. An important issue in such applications is the 
performance of SRPT for customers with a given service time. Bansal & Harchol-Balter [3] give 
some evidence against the opinion that SRPT does not work well for large jobs. They base their 
arguments on mean- value analysis. Some interesting results on the mean sojourn time in heavy 
traffic were recently obtained by Bansal and Bansal & Gamarnik [5], who show that SRPT 
significantly outperforms FIFO if the system is in heavy traffic. 

In the present paper we approach SRPT from a large-deviations point of view. We investigate 
the probability of a long sojourn time, assuming that service times are light-tailed. For heavy-tailed 
(more precisely, regularly varying) service-time distributions, Nuhez-Queija ^H] has shown that the 
tail of the sojourn-time distribution ^{Vsrpt > %} and the tail of the service-time distribution 
¥{B > x} coincide up to a constant. This appealing property is shared by several other preemptive 
service disciplines, for example by Last-In-First-Out (LIFO), Foreground-Background (FB) and 
Processor Sharing (PS); see [7j for a survey. Non-preemptive service disciplines, like FIFO, are 
known to behave worse: the tail of the sojourn time behaves like x¥{B > x}. This is the worst 
possible case, since it coincides with the tail behavior of a residual busy period; for details see 
again [7]. 

For light-tailed service times the situation is reversed. In a fundamental paper, Ramanan & 
Stolyar [2U] showed that FIFO maximizes the decay rate (see Section |21 for a precise definition) 
of the sojourn-time distribution over all work-conserving service disciplines. Thus, from a large- 
deviations point of view, FIFO is optimal for light-tailed service-time distributions. Since for any 
work-conserving service discipline the sojourn time is bounded by a residual busy period, the decay 
rate of the residual busy period is again the worst possible. Recently, it has been shown that this 
worst-case decay-rate behavior of the sojourn time is exhibited under LIFO, FB |15j . and, under 
an additional assumption, PS [T7| . 

The present paper shows that a similar result holds for both non-preemptive and preemptive 
SRPT, under the assumption that the service-time distribution has no mass at its right endpoint. 
Thus, for many light-tailed service-time distributions, as for example phase-type service times, large 
sojourn times are much more likely under SRPT than under FIFO. The derivation of this result is 
based upon a simple probabilistic argument; see Section E~T1 
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The case where there is mass at the right endpoint of the service-time distribution may be 
considered to be a curiosity; however, from a theoretical point of view, it actually turns out to 
be the most interesting case. The associated analysis, carried out in Section 14,21 is based on a 
relation with a GI/GI/1 priority queue. Since we could not find large-deviations results in the 
literature (an in-depth treatment of the M/G/l priority queue is provided by Abate & Whitt lj), 
we analyze this GI/GI/1 priority queue in Section |31 Another noteworthy feature of this case is 
that the resulting decay rate is strictly larger than the one under LIFO, but strictly smaller than 
under FIFO (with the exception of deterministic service times, for which the FIFO decay rate is 
attained). A similar result was recently shown in Egorova et al. ^0] for the M/D/l PS queue. 
However, in general examples of service disciplines that exhibit this "in-between" behavior are rare; 
see Section 5.1 of this paper for an overview. 

Our results on SRPT suggest that, from a large-deviations point of view, it is not advisable 
to switch from FIFO to SRPT. However, in Section 6 we show that this suggestion should be 
handled with care. Specifically, we investigate the decay rate of the conditional sojourn time, i.e., 
the sojourn time of a customer with service time y. We show that there exists a critical service 
time y* such that SRPT is better than FIFO for service times below y* and worse for service times 
larger than y* . A performance indicator is the fraction of customers with service time exceeding 
y*. We show that this fraction is close to zero for both low and high loads; numerical experiments 
suggest that this fraction is still very small for moderate values of the load. 

This paper is organized as follows. Section [21 introduces notation and states some preliminary 
results. In particular, the decay rates of the workload and busy period are derived in complete 
generality. Section 3 treats a two-class priority queue with renewal input and investigates the tail 
behavior of the low-priority waiting time. The results on SRPT are presented in Section 0J Section 
treats various implications of the results in Sections |3] and First, we compare our results with 
the decay rates for LIFO and FIFO, and show that the decay rate of the sojourn time under SRPT 
is strictly in between these two if the service-time distribution has mass at its right endpoint. We 
then treat the special case of Poisson arrivals; in particular we show that our results for the priority 
queue agree with those of Abate & Whitt PQ. In addition, we consider the behavior of the decay 
rates in heavy traffic. Conditional sojourn times are investigated in Section H3 We summarize our 
results and propose directions for further research in Section [7| 

2 Preliminaries: workload and busy period 

In this section we introduce the notation and derive two preliminary results. We consider a station- 
ary, work-conserving GI/GI/1 queue, with the server working at unit speed. Generic inter-arrival 
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and service times are denoted by A and B. To avoid trivialities, we assume that ¥{B > A} > 
(otherwise there would be no delays). Define the system load p = K{B} /K{A} < 1. Since p < 1, 
the workload process is positive recurrent and the busy period P has finite mean. The moment 
generating function of a random variable X is denoted by &x{s) = V,{e sX }. Throughout the paper 
we assume that B is light-tailed, i.e., that &b(s) is finite in a neighborhood of 0. Let W be the 
workload seen by a customer upon arrival in steady state. This workload coincides with the FIFO 
waiting time. Furthermore, let W y be the steady-state workload on arrival epochs in the GI/GI/1 
queue with service times B y = BI{B < y). Let P y denote the busy period in such a queue. Our 
first preliminary result concerns the logarithmic tail asymptotics for W. 

Proposition 2.1 As x — > oo, we have that logP{W^ > x} ~ — j w x, with 

lw = sup{s : $ A (-s)$ B (s) < 1}. (2.1) 

We call 7^ the decay rate of W. Generally, for any random variable U, we call "y u the decay 
rate of U if for x — > oo, 

log ¥{U > x} = — 7„x + o(x). 

If $a(— 1w)^b{1w) = 1) several proofs of Proposition 12.11 are available, see e.g. Asmussen 
0, Ganesh et al. |llj and Glynn & Whitt [T5|. We believe that the result in its present general- 
ity is known as well, but could not find a reference. For completeness, a short proof is included here. 

Proof of Proposition 12.11 

The upper bound follows from a famous result of Kingman |14j : 
logP{PF > x} < - j w x. 

For the lower bound we use a truncation argument. From Theorem XIII. 5. 3 of |2] (the condition 
of that theorem is easily seen to be satisfied for bounded service times), it follows that 

log¥{W y > x} ~ --fix, 

with = sup{s : ®A(-s)®By(s) < 1}. Consequently, since W{W > x} > ¥{W y > x}, 

liminf-logP{PF > x} > -j y . 

x—*oo x 

Since <&bv{s) is increasing in y, and $b»(s) converges to $s(s) as y — > oo, the decay rate 7^ is 
decreasing in y, and converges to a limit 7^ > 7 W . Since 7^ £ [0, 7^] for any y, and < &a(— s)^By(s) is 
convex in s and has a negative derivative in 0, we have &a(— 1w)^bv{Hw) — 1 f° r an V- Consequently, 

®a{-1* w )®b{i* w ) = hm ®a{-1* w )®bv{i* w ) < 1. 
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This implies that 7^ < 7^, so that lim^oo 7^, = 7^ = -y w . This yields the desired lower limit. □ 

We continue by deriving an expression for the decay rate 7 P of the busy period P. Sufficient 
conditions for precise asymptotics of P{P > x}, which are of the form Cx~ 3 / 2 e~" /pX , are given in 
Palmowski & Rolski j!9j . These asymptotics follow from a detailed analysis, involving a change-of- 
measure argument. We show that logarithmic asymptotics (which are of course implied by precise 
asymptotics) can be given without any further assumptions. 

Proposition 2.2 As x — > 00, we have logP{P > x} ~ —J P x, with 

lp = sup{s - ¥(*)}, (2.2) 

and = 



Proof 

We first derive an upper bound. Let X{t) be the amount of work offered to the queue in the interval 
[0,t]. In Lemma 2.1 of Mandjes & Zwart it is shown that for each s > 0, 

*(s) = lim -logE{e sX(t) }. (2.3) 

t— >oo i 

Using the Chernoff bound, we have for all s > 0, 

P{P > t} < F{X(t) >t}< e -^+logK{exp{sX(t)}} _ 

Consequently, 

limsup -logP{P > t} < -s + limsup- log E{exp{sX(i)}} = -(s - *(s)). 

Minimizing over s yields the upper bound for P{P > t}. We now turn to the lower bound, for 
which we again use a truncation argument. First, note that 

P{P > x}> F{P y > x}. 

For truncated service times, the assumptions in for the exact asymptotics (cf. Equation (33) 
in |19j ) are satisfied, and we have, with obvious notation, 

liminf-logP{P > x} > lim - log F{P y > x} = - sup{s - V v (s)} = -7^. 

x— >oo x x-*co x s >0 

So to prove the theorem, it suffices to show that j y — > 7 p for y — > 00. Define /^(s) = s — ^(s). 
It is clear that f y (s) — > /(s) = s — Vl/(s) pointwise as y — » 00 and that f y (s) is decreasing in y. 
Consequently, we have that the limit of 7^ for y — > 00 exists and that 

7* = lim 7^ = lim sup/ y (s) > sup/(s) = j p . 

y^oo ^ y^oo S>Q S>Q 
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It remains to show that the reverse inequality holds. For this, we use an argument similar to one in 
the proof of Cramers theorem (cf. Dembo & Zeitouni -{J], p. 33). Take yo such that ¥{B y > A} > 
for y > uq. Then there exist 5, r] > such that ¥{B y — A > 5} > r/ > for y > y$. Hence, for 
V > Vo, 

*bv(s)*a(-«) = ^{e sBy }E{e~ sA } = E{e s ^ BV - A ^} > rje sS . 

For s large enough, we now have 
1 



Since is increasing in s, we find that for s and y large enough, 

"^(i^) *•+•?(•*<-•>) - - 

Since (J?^ 1 (l/$s»(s)) is decreasing in y and is continuous in s, we see that for y > yo the level 
sets L y = {s : f y (s) > 7*} are compact. Moreover, since f y (s) is decreasing in y, the level sets are 
nested with respect to y. Consequently, the intersection of the level sets L y contains at least one 
element, say sq. By the definition of sq, we have f y (so) > 7p for every y. Thus, since f y converges 
pointwise, 

7 P = sup/(s) > /(s ) = lim f y (s ) > 7*. 
s >o f-* 00 

We conclude that 7^ — > 7 P as y — > 00, which completes the proof. □ 



3 The GI/GI/1 priority queue 

In this section, we consider the following GI/GI/1 two-class priority queue. Customers arrive 
according to a renewal process with generic inter-arrival time A. An arriving customer is of class 
1 with probability p, in which case he has service time B\ . Customers of class 2 have service time 
B2. Class- 1 customers have priority over class-2 customers. We assume that < p < 1, and that 
pE{B\} + (1 — p)K{B2} < E{A}, which ensures that the priority queue is stable. We are interested 
in the steady-state waiting time W% of a class-2 customer, that is, the time a class-2 customer has 
to wait before he enters service for the first time. Note that W2 is independent of whether the 
priority mechanism is preemptive or not. 

Let N\(t) be the renewal process generated by the arrivals of the class-1 customers, i.e., N±(t) = 
max{n : A\ t \ + • • • + Ai^ n < t}. Here A\ t i is the time between the arrival of the (i — l)-st and i-th 
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customer. A generic class- 1 inter-arrival time is denoted by A±. Note that A\ is a geometric sum 
of "original" inter-arrival times A: 



**oo = E {e ^} = f; P (i - P r* A (sr+i = 1 _ ( ^ A ^ (a) - 



ra=0 

Define 

iVi(t) 

i=i 

Hence, -X"i(i) is the amount of work of type 1 that has arrived in the system by time t. Let Pi be 
a generic busy period of class 1 customers. Finally, let P\(x) be a busy period of class- 1 customers 
with an initial customer of size x, so 

Pi(x) = inf{* > : x + X^t) < t}. 

Denoting the total workload in the queue at arrivals again by W (cf. Section 2), we have the 
following fundamental identity: 

W 2 = Pi{W), (3.1) 

where W and {Pi(x),x > 0} are independent. This identity holds since, using a discrete-time 
version of PASTA, W is also the workload as seen by an arriving customer of class 2. Set 

• ,w ~**(*Sm)- (3 ' 2) 

The main result of this section is the following. 

Theorem 3.1 As x ^ oo, we have logP{W2 > %} ~ — 7w 2 %, with 

-y W2 = sup {s-*i(s)}. (3.3) 

se[0,-y w ] 

Before we give a proof of this theorem, we first describe some heuristics, starting from Wi = 
P\{W). The most likely way for W2 to become large (i.e., W2 > x) involves a combination of two 
events: (i) W is of the order ax for some constant a > 0; (ii) Pi (ax) is of the order x. Clearly, there 
is a trade-off: as a becomes larger, scenario (i) become less likely, while scenario (ii) becomes more 
likely. Thus, we need to find the optimal value of a. For this we need to know the large-deviations 
decay rates associated with events (i) and (ii) . The decay rate of event (i) is simply a^ w . To obtain 
the decay rate of event (ii), note that 

P{Pi(ax) > x} w P{Xi(x) > (1 - a)x}. 



7 



One can show that the RHS probability has decay rate sup s>0 {(l — a)s — \&i(s)}. Thus, the optimal 
value of a, and the decay rate j W2 , can be found by optimizing the expression 

inf {a^ w + sup[(l - a)s + $i(s)]}. 

It is possible to show that the value of this program coincides with sup s6 r 0)7 — \I/i(,s)}. Moreover, 
the optimal value of a is if the optimizing argument of s — ^i(s) is strictly less than 7^, and it is 
1 — ^'1(7^) if sup se r 0)7 i{s — ^(s)} = j w — ^1(7™). In the proof below, we only use these heuristics 
to "guess" the correct value of a. 

Note that the two cases a > and a = correspond to two qualitatively different scenarios 
leading to a large value of W 2 - If a = 0, then the customer sees a "normal" amount of work upon 
arrival, while a > results in a workload of the order ax at time 0. This distinction between two 
different scenarios is typical in priority queueing, see Abate & Whitt |J| and Mandjes & Van Uitert 
|16j for more discussion. 

Proof 

We start with the upper bound. Using the Chernoff bound, we find that for s > 0, 

F{W 2 >x} = P{Pi(W0 >x}< P{W + X 1 (x)-x>0} < E{e^}e- M E{e*( l! }. 
Using ()2.3j) with X(t) replaced by X%(t), we see that for all s € [0,7™), 

lim sup - log P{ W 2 > x} < - [s - *i (s)]. 

x 

The proof of the upper bound is completed by minimizing over s, and noting that sup se r 0)7w ){s — 
*x(s)} = su Ps6[0i7m] {s-^i(s)}. 

We now turn to the lower bound. From the proof of Proposition 12.21 we see that Pi has decay 
rate 7 Pl = sup s>0 {s — \&i(s)}. Let si be the unique optimizing argument. In addition, let r be the 
probability that at the arrival of a class-2 customer to the steady state queue at least one customer 
of type 1 is waiting. It is obvious that r > 0. Since Pi(W) > s t Pi on this event, we see that 

¥{Pi(W) > x}> rP{Pi > x}, 
which by implies that 

lim inf — logP{W 2 > x} > — j Pl . 
Thus, if s\ < 7iu, we can conclude from this and the upper bound that 
lim - log P{W 2 > x} = — 7„, . 

x^oo x 
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What remains is to consider the case si > 7to . Since the concave function s — ^i(s) is increasing 
between and si, we see that sup se [ 07uj ]{s — ^i(s)} = 7«, — x I / i(7 w ). Thus, to complete the proof 
of the theorem, it suffices to show that 

lim inf - log P{ W 2 > x} > - [ lw - ( 7w )]. (3.4) 

x^co x 

Note that for any a > 0, 

F{W 2 >x}> P{W > ax}P{Pi(ax) > x} = e- a7 » x+o(x) F{Pi(ax) > x}. (3.5) 
Combining H3.5|) and Lemma 13.21 below, we see that by taking a = 1 — ^ ,/ 1 ( 7u) ), 

logP{W 2 > x}> -a<y w x + o(x) + logP{Pi(ax) > x} = -x(j w - *i( 7ro )) + o(x), 
which coincides with (|3.4j) . as was required. □ 

We now provide the result that was quoted in the proof above. 
Lemma 3.2 Set a = 1 — ^[(-jw). If j w < s\, then 

logP{Pi(ax) >x}> -x(j w (l - a) - #i( 7 «,)) + o(x). 
Proof 

To prove the lemma we use a change-of- measure argument. Define a probability measure Pi,{-} for 
v > such that 

F v {A 1;i edx} = e-^t^PIAyGda:}/^!-*!^)), i > 1, 
¥ u {B lti G dx} = e ra P{P M € dx}/$ B », i > 1. 

Choose v = v e such that 

W (z^e) = — ^ — f = — — - / — ; — — = 1 — a + e, e < a. 

We denote this probability measure by P^j-}. The drift under this new measure is 1 — a + e, making 
the event {Pi (ax) > x} extremely likely for large x. Note that vq = j w , by the definition of a, and 
since ^(s) is strictly increasing. 

Let T n be the Borel cr-algebra generated by A^x, . . . , A\ >n , B^i, . . . , Pi >n . Define S^ 1 = Ai t i + 
. . . + Ai tTl and S^ 1 = B\ t \ + . . . + B\^ n . Note that N\{x) := N\(x) + 1 is a stopping time w.r.t. the 
filtration (.P n ). Furthermore, note that the event {Pi (ax) > x} is .T^^-measurable. Finally, note 
that for every e > small enough, the process 1/M^,n > 1, with 

M* = exp{*i(i/ e )S* -v £ S*}, 
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is a martingale w.r.t. T n under P{-}, since the definition of \&i ensures that <3?Ai (— ^\{yei)^B^ {ye) = 
1. Thus, we have the following fundamental identity (see for example Theorem XIII. 3. 2 in j2]): 

¥{P 1 (ax) >x} = E Ue {M e Ni{x) I(Px(ax) > x)}. 
Furthermore, we have for any event S C T^m, 

P{Pi(ax) >x}> E 1/£ {M^ i(x) /(P 1 (ax) > x)I(S)}. (3.6) 
Take here 

S^S £ :={s*l {x) <(l-a + e)x}. 

Note that <Sjvi(a;)+i > x by definition and apply the definition of S £ to obtain from (|3.6f) the following 
lower bound for ¥{Pi(ax) > x}: 

liminf -log¥{Pi(ax) > x} > -u £ {l - a + e) + *i(i/ £ ) + liminf - log {Pi (ax) > x,S e }. 

x^oo x x^oo x 

By the law of large numbers, we have that ¥ Ve {Pi(ax) > x;S e } is bounded away from zero, 
uniformly in x for every e > 0. Consequently, 

liminf - logP{Pi(aa;) > x} > -uJl - a + e) + ^\(v e ). 

x— >oo x 

Now let e 10. This yields 

liminf - log P{Pi (aa?) > x} > -j w (l - a) + #1(7™), 
and the statement of the lemma follows. □ 

From Theorem 13.11 we can deduce the decay rate of the sojourn time V2 of class-2 customers. 
This turns out to be the same for both preemptive and non-preemptive service. 

Theorem 3.3 As x — > oo ; we have logP{V2 > x} ~ — ^ W2 x, where ^y W2 is as in \'J. 

Proof 

For the non-preemptive case, we have V2 = W2 + P>2, where W2 and P2 are independent. Since 
the decay rate of P>2 is larger than ^ W2 , and since the decay rate of a sum of independent random 
variables is equal to the smallest decay rate (see for example [15] for a short proof), the result for 
this case follows immediately. In the preemptive case, we use that 

V 2 = P 1 (W + B 2 ) > st W 2 , 

which gives us the lower bound. The upper bound follows the same lines of the proof of Theorem 
IXTIand noting that E{e^ g2 } < cxd. □ 
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4 Shortest Remaining Processing Time 

In this section we present our results on the sojourn time under the SRPT discipline. Define Vsrpt 
as the steady-state sojourn time of a customer under the preemptive SRPT discipline. Further, 
define the right endpoint xb by xb = sup{x : ¥{B > x} > 0}. When it comes to determining the 
decay rate of Vsrpt, it turns out to be crucial whether 

F{B = x B } = 0, (4.1) 

or not. In the first subsection, we show that if (|4.1|) holds, then the decay rate of Vsrpt is equal to 
7 P , the decay rate of the busy period P. If 1)4.11) does not hold, the situation is more complicated. 
In that case we use the results of the previous section to show that the decay rate of Vsrpt is equal 
to 

7u>2> where W2 is the waiting time in a certain auxiliary priority queue. This is the subject of 
the second subsection. We also show that for the non-preemptive SRPT discipline the same results 
hold. 

4.1 No mass at the right endpoint 

In this section we prove the following theorem. 

Theorem 4.1 Suppose that F{B = xp} = 0. Then logF{VsppT > x} ~ — j p x for x — > 00, with 
7 p as in 

Proof 

Let Vsrpt be the sojourn time of a tagged customer with service time B. Since Vsrpt < P*, 
where P* is the residual busy period P(W), and since for light tails the decay rate of P* coincides 
with that of P (this follows from Lemma 3.2 in PQ), we see that 

limsup-logP{Ks7jPT > x} < -j p . 

£— >0O X 

Thus, it suffices to show that the corresponding result holds for the lower limit. For this, we 
construct a lower bound for F{Vsrpt > x}. Assume first that xp = 00. Let A be the last inter- 
arrival time before the tagged customer arrives, Bq be the service time of that customer, and a be 
such that P{A < a} > 0. Then, for all y, 

nVsRPT>x} > ¥{V S RPT>x;B>y,A<a,B <y} 

> F{A < a}F{B > y}F{B < y}F{P y ~ a > x}. 

The last inequality holds since conditional on A < a, B > y and -Bo < V, the tagged customer 
has to wait at least for the sub-busy period generated by the customer that arrived before him, 
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and this sub-busy period is stochastically larger than P v a . Since P{^4 < a}P{B > y} > 0, and 
^{Bo < y} > for y large enough, we have that 

liminf -logF{V S RPT > x} > - 7 ^ a 

x— >oo x 

for y large enough. Letting y — > xp = oo, we obtain jp a — > 7 p , as in the proof of Proposition 2.2. 

If xb < 00, the above proof can be modified in a straightforward way if ¥{A < a} > 
for all a > 0. However, this may not be the case in general and therefore we have to make a 
more involved construction. By definition of xb, there exists a decreasing sequence (e n ) such that 
¥{xb - e n < B < x B - e n /2} > for all n, and e n — > as n — » 00. Since P{-B > A} > 0, 
we can assume that £1 is such that P{^4 < xb — 2ei} > 0. Let R n be the event that the last 
[xb/zuI customers that arrived before the tagged customer had a service time in the interval 
[xb — £n,%B — £n/2], and that the last [xb/£u\ inter- arrival times were smaller than xb — 2e n . By 
definition of e n , we have ¥{R n } > for all n. 

Furthermore, by the SRPT priority rule, we see by induction that on the event R n , after 
the kth of the last n inter-arrival times, there is a customer with remaining service time larger 
than ke n . Hence, at the arrival of the tagged customer, there is a customer in the system with 
remaining service time in the interval [xb — £n> X B — £n/2]. If the tagged customer has service time 
B > xb — £n/2, his sojourn time satisfies Vsrpt > P Xs_e " on R n . Consequently, for all n € N, 

HVsrpt >x}> PiRnMB >x B - e n /2}P{P XB - £ " > x}. 

This implies 

liminf -log¥{ V S rpt > x} > -~f* B ~ £n . 

x^oo x 

Letting n — > 00, and hence e n { 0, we get jp B ~ £n — > j p , as before. This completes the proof. □ 

The property that the decay rate of the sojourn time is equal to that of the busy period is 
shared by a number of disciplines, see Section f5. 11 Further, we remark that for light tails, 7 P is the 
smallest possible decay rate for the sojourn time in the class of all work-conserving disciplines: the 
sojourn time is bounded above by the residual busy period P* , and for light-tailed service times 
P* has decay rate 7 P (cf. Lemma 3.2 in pP). 

4.2 Mass at right endpoint 

If there is mass at the right endpoint xb of the service-time distribution, then the tail behavior of 
Vsrpt is more complicated. To obtain the decay rate of Vsrpt for this case, we identify the SRPT 
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queue with the following two-class priority queue. Let the customers of class 1 be the customers 
with service time strictly less than xb- Then B2 = xb and B\ is such that 

P{-Bi < x} = ¥{B < x \ B < x B }, x>0. (4.2) 

Theorem 4.2 Suppose that ¥{B = xb} > 0. Then \og¥ {Vsrpt > x} ~ ~7v x for x — > 00, with 

7« = sup {a-*x(s)}, (4.3) 

se[o,7„] 

where is as in ^.V. ii)) . and Si is as in j4-2\ ). 
Proof 

First, note that if q = ¥{B = xb} = 1, we have a G/D/l SRPT queue, which has the same 
dynamics as a FIFO queue. Indeed we obtain \Pi = 0, implying j v = j w , cf. Proposition 2.1. 
Assume therefore that < g < 1, let Vs^py be the sojourn time of a tagged customer with service 
time B, and write 

nVsRPT >x} = q¥{V SR PT > x \ B = x B } + (1 ~ q)¥{V SR PT >x\B< x B }- 
From the nature of the SRPT discipline, or a simple coupling argument, it is obvious that 

nVsRPT >x\B<x B }< ¥{V S rpt >x\B = x b }. 
Therefore, it suffices to consider the tail behavior of Vsrpt, where 

nVsRPT <x} = ¥{V S rpt <x\B = x B }. 

First, we note that Vsrpt is bounded from below by the time it takes until our tagged customer 
receives service for the first time. A crucial observation is that this period coincides with the 
low priority waiting-time W2 defined in Section 3. Second, note that Vsrpt is upper bounded by 
the sojourn time Vi of a class-2 customer in the above priority queue. Hence, we have W% < s t 
Vsrpt <st V 2 - 

Further, Vz satisfies V2 = P\{W + xp) for preemptive service; for non-preemptive service, we 
have V 2 = Pi(W) + xb- Since the logarithmic asymptotics of W + xb coincide with those of W, we 
can mimic the proof of Theorem 13.11 to see that in both cases the decay rate of V 2 coincides with 
that of W 2 . Hence the decay rate of Vsrpt is given by ()3.3|) . and the proof is completed. □ 

The intuition of how Vsrpt becomes large is the same as that of W 2 in Section 3. In Section 
15.11 below, we show that if there is mass in the endpoint xb, then the decay rate of the sojourn 
time lies strictly between the maximal value (obtained for FIFO) and the minimal value (LIFO). 
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5 Complements 



In the previous two sections we have derived expressions for the decay rates j m2 and j v . In this 
section we derive some properties of these decay rates. Specifically, in Section l5~Tl we compare *y W2 
and 7„ with 7^ and j p . We show that for q = F{B = xb} € (0, 1), we always have j p < j W2 < 7^. 
Consequently, if q G (0, 1), we also find that j v = ^ v {q) € ("fp, Jw)- As explained in the introduction, 
this is a non-standard result. We also indicate that j v can take any value between j p and j w , 
depending on the value of q. 

Further, in Section [5.21 we specialize our expression of j W2 to the case of Poisson arrivals. For 
priority queues, a quite involved expression for the decay rate was given in pQ. We show that this 
expression can be simplified, and that it coincides with our expression of j W2 . 

Finally, in Section 15.31 we derive heavy-traffic approximations for ^ W2 . 

5.1 Comparison with other service disciplines 

In this subsection, we compare the decay rates j m and j v with the decay rate of the sojourn time 
under FIFO and LIFO, which respectively equal 7^ and j p . 

We first show that for the priority queue described in Section|21 the decay rate of W2 is different 
from those of P and W. 

Proposition 5.1 Assume < p < 1. Then j p < ~f W2 < 7^. 
Proof 

Since x I / i(s) > for s > 0, we have by Theorem 13. II that 

j W2 = sup {s-^i(s)}< sup S = 7u,. 
se[0,7„] se[0,-y w ] 

To prove the inequality j m > 7-, we provide a different construction of the function ^i(s). Let 
Bp be a service time which is equal to B\ with probability p and with probability 1 — p. It is 
clear that < 3>b(s). The amount of work X\(t) generated by class-1 customers between 

time and t is the same in distribution as the amount of work generated by the arrival process 
with inter-arrival times A and service times B p . We thus get that ^i(s) = — ^^(I/^.b (s)). Since 
&a(s) is strictly increasing in s, so is its inverse Combining this with 3>b (s) < &b(s) leads 

to the conclusion that x I'i(s) < ^(s). Recall that the residual busy period P* satisfies P* = P(W); 
its decay rate is given by sup^o^]! 5 ~~ ^(s)}' as can be seen by mimicking the proof of Theorem 
3.1. Hence, 

j W2 = sup {s-^i(s)}> sup {s - *(s)} = 7 P *- (5-1) 
sg[o,-j w ] se[o,j w ] 
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The proof is completed by recalling that for light-tailed service times, "y p = j p * 



□ 



From Theorem 14.21 and Proposition 15. II we conclude a similar result for the SRPT discipline. 
Corollary 5.2 If < ¥{B = xb} < 1, then j p < -y v < j w . 

Hence, if there is mass in the endpoint xb, then the decay rate of the sojourn time under SRPT 
lies strictly between those under LIFO and FIFO. 

The following consequence of Theorem 14.21 indicates that in some sense, all values between 
those of LIFO and FIFO are assumed. Let F q be the mixture of a distribution bounded by c, and 
a distribution with all mass in c, such that ¥{B = c} = q. Assume that c < E{^4}, and let J v (q) 
denote the decay rate of the sojourn time in a queue with service-time distribution F q . 

Proposition 5.3 The decay rate J v (q) is continuous in q. In particular, it increases from 7 P (0) 
to 7to(1)? and assumes all values in between. 



By Theorem 14.21 it is enough to show that Jwifl) an d are continuous in q. Since ^b± is 

constant in q, and &a 1 is continuous in q, also ^i(s) is continuous in q. Furthermore, since <3?a is 
constant in q, <&b is continuous in q, and Jwiq) is finite for all q, Proposition 12.21 implies that Jwip) 



Tabled below shows the decay rates of the sojourn time that are known in the literature. It 
turns out that the property in Corollary 15.21 is quite special, since almost all other known decay 
rates are either minimal or maximal. 

5.2 Poisson arrivals 

Consider the priority queue of Section 3, with the additional assumption that A has an exponential 
distribution with rate A. Letting Ai = p\ and A2 = (1 — p)A, we get that ^i(s) = Ai( < 1 > b 1 (s) — 1). 
Thus, we have 



Proof 



is continuous in q, and the proof is completed. 



□ 



7 W2 = sup {s - Ai($Bi(s) - 1)}. 



(5.2) 



se[0,-y w ] 



Suppose that 1 — Ai^ 



(,1w) > 0. Then the maximum value is attained in 7^ and we have 



7w 2 =lw~ Ai($B 1 (7 tu ) - 1). 



(5.3) 
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decay rate 


discipline 


condition 


queue 


7 P (minimal), (|2.1j) 


LCFS pu 




GI/GI/1 




FB JT5] 




M/GI/1 




PS [T7j 


Vc > : logPji? > clogrc} = o[x) 


GI/GI/1 




ROS [HI 




M/M/l 




SRPT here 


P{B = x B } = 


GI/GI/1 


7 (in between) 


PS pm 




M/D/l 


7„ (in between), (|4.3j) 


SRPT here 


< F{B = x B } < 1 


GI/GI/1 


7^ (maximal), (|2.2() 


FCFS HO] 




GI/GI/1 




SRPT 


P{£ = x B } = 1 


GI/D/1 



Table 1: The decay rate of the sojourn time under several disciplines for light-tailed service times. 



This expression is rather explicit, as 7^ is the positive solution of the equation 

7w = A($ B ( 7w ) - 1). (5.4) 

The goal of this subsection is to show that in the case of Poisson arrivals, our expression Q5.3JI 
coincides with the expression of j W2 given by Abate & Whitt 1 . Assuming that E{l?i} = 1, it is 
shown in P, p. 18, that — ^ W2 is the solution of f{s) = 1/p, with 

f(s) = — ^ — ho\s) H T — 92e(zi(s)), zi(s) = s + Ai - Xih(s), 

Pl+ P2 Pl+ P2 

x 1 - h(s) 1 - hjs) A l-Si(a) 
^0 ( s ) = TTT = — > 92e(s) - 



s + p 1 -p 1 b 1 {s) z 1 (s) sg 2 i 
where b\(s) is the LST of the M/G/l busy period, g 2 (s) = <1?b 2 (— s) an d 321 = lE{i?2}- 

Our expression of *y W2 seems preferable, although we hasten to add that the form provided by 
is more convenient when considering the more complicated task of obtaining precise asymptotics, 
as is done in pQ. 

We now simplify the description of j W2 in fj] . Since p = p\ + p 2 , we have 

1 1 ; n Pi l-bi(s) . P2 l-fl2(*i(s)) 
- ~ /(«) = ; r^— + 



P1+P2 P P1 + P2 zi(s) pi + p 2 z 1 (s)E{B 2 } 

Hence, 

Zl (s) = P1 [1- b^s)] + ^^[1 - S2(*i00)]. 
Consequently, 

s = A 2 [l - S2(«i(a))] = A 2 (l - $ Ba (-*i(*)). 
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Since A<3? B (,s) = Ai<I> Bl (s) + A2 < £ B2 (s), we can rewrite this into 



s + Ai[l - * Bl (-zi(a))] = A[l - * B (-zi(«))]. 



(5.5) 



Since the LST of the busy period satisfies the fixed point equation 



= $ Bl (-2l(s)) 



(5.6) 



we can rewrite ([5 .5(1 as 



s + Ai(l-&i(s)) = A[l-* B (-2i00)] 



and thus, using the definition of Z\ (s) 



*i (a) = A[l-* B (-*i (*))]■ 



Using the definition of Tu, in (|5.4|1 . we see that ^ W2 is the solution of 7^ = — Zi(s). We now give 
an alternative expression for — z±(s). An alternative expression for the busy period transform was 
found by Rosenkrantz [2^: defining </>(s) = Ai(l — $ Bl (— s)) — s, it holds that 



Since $Sj(s) is strictly increasing, it follows from (|5.6|) and 1)5. 7j) that 21 (s) = 1 (s). Hence, 7 W2 
is the solution of "y w = —cp~ 1 (s), and thus we obtain 

lw 2 = 4>{-lw) = lw- Ai($ Bl (7 w ) - 1), 
which is indeed equal to our expression ()5.3|) , 

To conclude this section, we remark that for the M/G/l queue, the decay rate 7„ can take on 
a simple form. Suppose that 1 — \i& Bl {~f w ) > 0, and that < ¥{B = xb} < 1. Then by Theorem 
14.21 and the expression for j w given in (|5.4|) . we have 

lv = Iw ~ Ai($ Bl (7 w ) - 1) = A(* B (7«0 - 1) - Ai($ Bl (7 1u ) - 1) 
= A 2 ($ B2 (7») - 1) = AP{5 = xbW*** ~ 1). 

5.3 Heavy traffic 

We now examine the behavior of the decay rate ^ v of the SRPT sojourn time in heavy traffic. The 
aim of this section is to show that the behavior of this decay rate critically depends upon whether 
¥{B = xb} > or not. If ¥{B = xb} = 0, then j v = ^ p by Theorem 14.11 The results in Section 



b 1 ( S ) = <S> Bl (-<p- 1 (s)). 



(5.7) 



17 



4.2 of \n\ then imply that "f v ~ C(l — p) 2 for some constant C. We now show that a fundamentally 
different behavior applies if P{B = xb} > 0. 

Since, in this case, we have a relationship with the GI/GI/1 priority queue, we consider first 
the setting of Section 3. We let the service time B<i increase in such a way that p — ► 1. Specifically, 
we consider a sequence of systems indexed by r, such that p, A\, A2 and B\ are all fixed, and that 
£?2 = -62(0 is such that the traffic load satisfies p r = 1 — 1/r. Let j w (p r ) denote the decay rate of 
the workload in such a queue. 

If we let o\ < 00 be the variance of A and assume that the variance of B{r) = B\ + B2(r) 
converges to o 2 B , then it holds that for r — > 00 (cf. Corollary 3 of 

lw{Pr) ~ K(l — p r ), (5.8) 

with K = 2/{a 2 A + ct 2 ^). In particular, r y w {p r ) I 0. Consequently, if r is large enough, we always 
have r y W2 (p r ) = Jw(Pr) — ^i(lw(Pr)) by Theorem 13.11 Since VPi(s) ~ /o(l)s as s J, 0, where p(l) is 
the load in the high priority queue, we obtain the following heavy-traffic result for j W2 . 

Proposition 5.4 For p — > 1 as described above, we have 

7 ^~^(l-p(l))(l-p). 

Thus, also 7,; is of the order (1 — p) if ¥{B = xb} > 0. This behavior is notably different from 
the (1 — p) 2 behavior of j p . 

6 Conditional sojourn times 

Our results in Section 4 and 5 show that the decay rate j v for SRPT is smaller than 7^, which is 
the decay rate of the waiting (and sojourn) time under FIFO. Thus, one could say that according 
to this performance measure, SRPT is worse than FIFO. 

The reason that the sojourn-time decay rate under SRPT is small is apparent when taking a 
closer look at the proof in Section 14. It the sojourn time of a customer with a (very) large service 
time looks like a residual busy period. However, smaller customers may have a much shorter sojourn 
time. In fact, for the conditional sojourn time Vsrpt{d) = \Ysrpt \ B = y] under the preemptive 
SRPT discipline, the following proposition holds. 

Proposition 6.1 IfP{B = y} = 0, then logP{VsRPT(y) > %} ~ — lp% as x — > 00. 
Proof 

For the lower bound, we remark that Vsrpt{u) is stochastically larger than the residual busy 
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period P* y in the queue with service time B y . This residual busy period has decay rate 7$. For the 
upper bound, we consider an alternative queue with generic service time B y , stationary workload 
at arrival instants W v and busy period P y . Now observe that in the original queue, at any point 
in time, at most one customer with original service time larger than y has remaining service time 
smaller han y. Hence, we can bound 

V S RP T (y) <st P y (W y + y + y), 

where P y (x) is a busy period in the alternative queue starting with an exceptional customer of 
length x. Applying the Chernoff bound, and arguing like in the proof of Proposition 12.21 we find 

limsup - logF{V SR PT{y) >t}<- sup {s - *»(s)} = -7^, 
t-»oo t s6[0,7 tt y] 

where the last equality follows from Q5.1j) . The upper bound follows from noting that P y and P* y 
have the same decay rate, and the proof is completed. □ 

Suppose that B has a density, so that jp is continuous in y. Then the function 7^ strictly 
decreases in y, and converges to 7 P > -y w as y — > 00. Further, — ► 00 as y — > 0, since *ff y (s) — > 
as y — > 0. Hence, there exists a critical value y* for which 7p = ~f w . Thus, when the decay rate 
is used as a performance measure, one could say that FIFO is a better discipline than SRPT for 
customers of size larger than y*; the fraction of customers that suffer from a change from FIFO to 
SRPT is ¥{B > y*}. We now describe the behavior of y* as a function of p for p — > 1 and p —* 0. 

Proposition 6.2 Lei y* = sup{y : 7p > 7«,}. If p 1, then y* — » x^. 

Proof 

Let y < ib be fixed, and let 7p(p) be the decay rate of P y as a function of p, and define ^ w {p) 
similarly. Since P y is a busy period in a stable queue, even when p = 1 in the original queue, we 
have 7p(p) > 7p(l) > for all p < 1. By (|5.8|) . we have for p large enough, 

l w (p)<l y p(l)<ll{p). 

Hence, for p large enough, y* > y. Since y < xb was arbitrary, the proof is completed. □ 
Proposition 6.3 If the service time B has decay rate 75 G (0, 00), i/ien y* — > 00 /or p — > 0. 
Proof 

Let p — > by setting the generic inter-arrival time equal to rA and letting r — > 00. Since $ rj 4(x) — > 
for all x < 0, we have ^"^(x) — > for all < x < 1. Hence, for all y, 

7 >)=sup{ s + <I>-( * )}^oo, ,^0. (6.9) 
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The workload does not depend on the discipline as long as the discipline is work-conserving. Further, 
conditioned on it being positive, the workload under FIFO is stochastically larger than a residual 
service time, which for light-tailed distributions has the same decay rate as B. Hence, we have 
lw{p) < 7b < oo for all p. It then follows from (|6.9|) that ^w(p) > 'y-w(p) eventually as p — > for all 
y, and we can conclude that y* — > oo as p — > 0. □ 

6.1 Numerical example 

As an illustration, we compute y* and ¥{B > y*} for the M/M/l queue with E{£>} = 1 and arrival 
rate A (so that p = A). Figure Q shows the probabilities F{B > y*} for various values of p. 



P{B > y*{p)} 



+ ++ + 



Figure 1: The probabilities ¥{B > y*(p)} for p G (0, 1) in the M/M/l queue. 

From the figure, it is clear that y* becomes very large under low and high loads. But even for 
moderate values of p it is clear that about 85 percent of the customers would prefer (from a large- 
deviations point of view) SRPT over FIFO. 

7 Conclusions 

To conclude the paper, we summarize our results. For the GI/GI/1 queue with light-tailed service 
times, we obtained expressions for the logarithmic decay rate of the tail of the workload, the busy 
period, the waiting time and sojourn time of low-priority customers in a priority queue, and the 
sojourn time under the (preemptive and non-preemptive) SRPT discipline. 

For the sojourn time under SRPT, it turns out that there are three different regimes, namely 
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for service times with no mass, with some mass and with all mass in the endpoint of the service-time 
distribution. In the first case the decay rate is minimal among all work-conserving disciplines, in 
the last case it is maximal, but if there is some mass in the endpoint, then the decay rate lies strictly 
in between these two. The large-deviations results for the unconditional sojourn times suggest that 
a switch from FIFO to SRPT is not advisable. The results in Section 6 show that this suggestion 
is only valid for very large service times: in the M/M/l queue, at least about 85 percent of the 
customers would benefit from a change from FIFO to SRPT. 

There are several topics that are interesting for further research. First of all, large deviations 
for the queue length under SRPT are not well understood. A second problem is to obtain precise 
asymptotics for the tail behavior of the low-priority waiting time, or perhaps even the sojourn 
time. Finally, it would be interesting to compare conditional sojourn times of FIFO and PS from a 
large-deviations point of view. It is not clear to us which discipline performs better, and what the 
influence of the job size might be. 
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